Site identification in high-throughput RNA-protein interaction data

نویسندگان

  • Philip J. Uren
  • Emad Bahrami Samani
  • Suzanne C. Burns
  • Mei Qiao
  • Fedor V. Karginov
  • Emily Hodges
  • Gregory J. Hannon
  • Jeremy R. Sanford
  • Luiz O. F. Penalva
  • Andrew D. Smith
چکیده

MOTIVATION Post-transcriptional and co-transcriptional regulation is a crucial link between genotype and phenotype. The central players are the RNA-binding proteins, and experimental technologies [such as cross-linking with immunoprecipitation- (CLIP-) and RIP-seq] for probing their activities have advanced rapidly over the course of the past decade. Statistically robust, flexible computational methods for binding site identification from high-throughput immunoprecipitation assays are largely lacking however. RESULTS We introduce a method for site identification which provides four key advantages over previous methods: (i) it can be applied on all variations of CLIP and RIP-seq technologies, (ii) it accurately models the underlying read-count distributions, (iii) it allows external covariates, such as transcript abundance (which we demonstrate is highly correlated with read count) to inform the site identification process and (iv) it allows for direct comparison of site usage across cell types or conditions. AVAILABILITY AND IMPLEMENTATION We have implemented our method in a software tool called Piranha. Source code and binaries, licensed under the GNU General Public License (version 3) are freely available for download from http://smithlab.usc.edu. CONTACT [email protected] SUPPLEMENTARY INFORMATION Supplementary data available at Bioinformatics online.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Identification of RNA-binding sites in artemin based on docking energy landscapes and molecular dynamics simulation

There are questions concerning the functions of artemin, an abundant stress protein found in Artemiaduring embryo development. It has been reported that artemin binds RNA at high temperatures in vitro, suggesting an RNA protective role. In this study, we investigated the possibility of the presence of RNA-bindingsites and their structural properties in artemin, using docking energy ...

متن کامل

Analysis of sequencing data for probing RNA secondary structures and protein-RNA binding in studying post- transcriptional regulations Authors

High-throughput sequencing has been used to study post-transcriptional regulations, where the identification of protein-RNA binding is a major and fast-developing sub-area, which is in turn benefited by the sequencing methods for whole-transcriptome probing of RNA secondary structures. In the study of RNA secondary structures using high-throughput sequencing, bases are modified or cleaved accor...

متن کامل

HTS-Net: An integrated regulome-interactome approach for establishing network regulation models in high-throughput screenings

High-throughput RNAi screenings (HTS) allow quantifying the impact of the deletion of each gene in any particular function, from virus-host interactions to cell differentiation. However, there has been less development for functional analysis tools dedicated to RNAi analyses. HTS-Net, a network-based analysis program, was developed to identify gene regulatory modules impacted in high-throughput...

متن کامل

Identifying Protein Complexes in High-Throughput Protein Interaction Screens Using an Infinite Latent Feature Model

We propose a Bayesian approach to identify protein complexes and their constituents from high-throughput protein-protein interaction screens. An infinite latent feature model that allows for multi-complex membership by individual proteins is coupled with a graph diffusion kernel that evaluates the likelihood of two proteins belonging to the same complex. Gibbs sampling is then used to infer a c...

متن کامل

A structured outputs method for predicting protein binding sites

Protein-protein interactions have essential roles in nearly all biochemical processes. While high-throughput methods exist for experimentally identifying interaction partners, the task of determining binding site locations remains arduous. We consider the prediction of protein binding site location as an instance of the label sequence problem and outline a representation in the framework of str...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Bioinformatics

دوره 28 23  شماره 

صفحات  -

تاریخ انتشار 2012